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Abstract 
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PACS numbers: 02.50.-r,89.75.Da, 05.10.-a,95.75.Pq,95.75. Wx 
Keywords: Long-Range Dependence, Scaling Phenomena 



*Electronic address: richard@richardclegg.org 
^Electronic address: mmdl@york.ac.uk 



1 



I. INTRODUCTION 



Long-range dependence (LRD) is a statistical phenomenon which is used to describe a 
process which exhibits significant correlations even between widely separated points. A more 
formal definition is given in the next part of this paper. Roughly speaking, a process with a 
high degree of LRD can be thought of as correlated at all scales. A good introduction to the 
topic of LRD is provided by |l| and a discussion in the context of telecommunications traffic 
is given by 3]. LRD is most often characterised by the Hurst parameter, H, which is in the 
range (1/2, 1) for a time series which exhibits LRD. If H = 1/2 then this indicates the data 
is independent or has only short-range correlations. The topic of LRD has attracted a great 
deal of interest since LRD has been observed in time series measured in fields as diverse as 
finance, internet traffic and hydrology. 

This paper introduces and tests a mechanism for generating LRD based on an infinite 
Markov chain. The traffic stream generated is binary in nature and the model has only 
two parameters, the mean and the Hurst parameter of the generated traffic. This section 
provides a brief introduction to the topic of LRD in the context of telecommunications 
networks and discusses currently used generation mechanisms for modelling LRD and also 
methods which are currently used to measure LRD in a time series. Section |H] describes 
the infinite Markov model. In Section IIHI it is proved that the model does in fact generate 
a time series with a given mean and with LRD having a given Hurst parameter. Finally, 
Section HXl tests the model against other standard LRD generation models and discusses the 
advantages of the model. 



A. A Brief Introduction to LRD 



A number of different (and not necessarily equivalent) definitions of LRD are in use in 
the literature. A commonly used definition is the one given here. 

Definition 1. A weakly stationary time series exhibits LRD if the absolute value of its 
autocorrelation function (ACF) p(k) does not have a finite sum. That is, 

oo 

£ ip(*)i = °°- 

k=—oo 
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It is often assumed that the ACF has the specific asymptotic form, 



p(k) ~ c p k~ a , (1) 

for some positive constant c p and some real a G (0, 1). Note that this is equivalent to a 
functional form for the spectral density /(A) defined by, 

9 OO 
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fc=— oo 

Equation (JTJ) is equivalent to, 

/(A)~ C/ |A|-^, 

as A — > 0, where a 2 is the variance, c/ is some positive constant and /3 G (0, 1). The Hurst 
parameter is then given byif=(l + /3)/2. 

It should be noted that here, and throughout this paper, f(x) ~ <? (x) is used to mean 
f(x)/g(x) — > 1 as x — > oo — sometimes, in the literature, the symbol is used to mean 
asymptotically proportional to or f(x)/g(x) — *> k for some constant as x — > oo. 

The constant a in (£[]) is sometimes expressed in terms of the Hurst parameter H = 1— a/2. 
The Hurst parameter as defined by this relation and (0) is the most commonly used measure 
of LRD in the telecommuncations literature. 

The reason for the interest in the subiect within the field of telecommunications is the 
fact that LRD has beea o te ve d ia ™ ous tim e sa.es reMed t o — tr a ffi c ^Q. 
It is widely recognised that the engineering implications of LRD on queuing performance 
can be considerable. If Internet traffic is not modelled well by independent or short-range 
dependent models then much traditional queuing theory work based upon the assumption of 
Poisson processes is no longer appropriate. Traffic which is long-range dependent in nature 
can have a queuing performance which is significantly worse than Poisson traffic. Modelling 
has shown how phase transitions can arise in computer networks lol, how and this phase 

nn 

transition can be related to LRD [3, ljJ • 

In general it has been found that a higher Hurst parameter often increases delays in 
a network, increases the probability of packet loss and affects a number of measures of 
engineering importance. In fact Erramilli et al. claims that the Hurst parameter is "...a 
dominant characteristic for a number of packet traffic engineering problems...". Some of the 
effects on queuing performance are given by HQ. However, shows that while the 
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Hurst parameter is important to queueing, the relationship is not a simple one — in some 
cases a high Hurst parameter may improve performance or have no effect. The issue of the 
scale and nature of the effect of LRD on queuing remains contentious. 



B. Current Generation Mechanisms for LRD 

A number of modelling techniques are currently used for generating traffic streams ex- 
hibiting LRD. Of these, the most commonly encountered in the telecommunications litera- 
ture are Fractional Gaussian Noise processes (FGN), Fractional Auto- Regressive Integrated 
Moving Average models (FARIMA, also refered to as ARFIMA), iterated chaotic maps and 
wavelet modelling. 

The FGN process is usually defined as increments of the Fractional Brownian Motion 
(FBM) process. An FBM process Bjj(t) is defined by: 

F[B H (t + k)-B H (t)<x] = 

X 



(2n)-h- H J exp du, 



where P [X] is a probability of an event X and H G (1/2, 1) is the Hurst parameter. This can 
be seen as a generalisation of the more common Gaussian White Noise process. A number 
of authors have described m ethods fo r g enera ti „ g FGN aad FBM Q, A aad Q. 



The FARIMA model is an obvious modification of the traditional ARIMA(p, d, q) model 
from time series analysis, allowing d G (—1/2, 1/2) instead of d G Z+. FARIMA processes 
were proposed by [r| and a description in the context of LRD can be found in l[ pages 59- 
66]. As might be expected the d parameter relates to the Hurst parameter. The relation is 
simply H = d+1/2 — note that this only produces legitimate values for H when d G (0, 1/2). 

Iterated chaotic maps which exhibit intermittency are also commonly used to generate 
time series exhibiting LRD. Given a starting value Xq G (0, 1) then a time series {x n : n G N} 
can be generated by the following map 



r -I- kzd r mi < r < d 



X 



n+1 



r i H _ r ) m 2 d < r < 1 



where d G (0, 1) and m 1 , m 2 G (3/2, 2). If x G (0, 1) then x n G (0, 1) for all n G N. If this 
time series is used to generate a binary time series {y n : n G N} by the rule yi = if x,- L < d 



0.2 0.4 0.6 0.8 1 

x(n) 



FIG. 1: Graph of a map which can be used to generate LRD. 

and jji — 1 otherwise then the series can be shown to exhibit LRD with a Hurst parameter 
given by H = (3m — 4) /(2m — 2). This map is illustrated in Figure ^ An explanation for 
the presence of LRD in this map is provided by examining the behaviour of the orbits at 
Xj near zero or one. The escape from points near zero or one is extremely slow and this 
causes long sequences of zeros or ones in the generated yi series. Pioneering work in this area 
17 ] with early applications to telecommunications being given by This mechanism 
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is particularly suited for generating data for modelling of packet networks since the ON 
(yi — 1) state can be considered to be a packet and the OFF = 0) state as an interpacket 
gap. 

In fact, the work described in [l7j relates to the Markov chain based work described in 
this paper as it approximates the chaotic map approach as a piecewise linear map which 
can, in turn, be modelled as a Markov chain with the topology described later. A number of 
other papers have used Markov chains to model linear approximations to intermittent maps 
I20L l2lL l22j]. The papers 0, Q and Q relate the piecewise linear approximations of 



intermittency maps to LRD and show how certain parameters for Markov chains give rise 
to LRD in a process arising from the chain. However, we can find no reference to papers 
which relate intermittency in general to LRD. 

A technique gaining favour in modelling (and also in measuring) LRD is wavelet anal- 
ysis. This allows the LRD hypothesis to be generalised to multifractals. LRD defines a 
single scaling behaviour for the system (which applies in the tail of the ACF) - - if this 



5 



scaling behaviour was the same at any scale then the process defined would be a monofrac- 
tal. However, if the scaling behaviour differs across scales then the process is mult if r act al. 
There is some evidence that Internet traffic exhibits different scaling behaviour at differen ; 



timescales. A general description of multifractal processes and wavelets is found in [24 1 
and a description of how wavelets can be used to create models with the same multifractal 
spectrum as a given data set can be found in j^j. 



C. Measurement Techniques for LRD 

A number of techniques are known for estimating the Hurst parameter from real data. 
There is no single technique which can be considered perfect. Computer code and analysis 
of various techniques can be found at [26]. Comparisons of measurement techniques can be 
found in j^, 13 and j^. In this paper, five techniques are used: the R/S statistic (in 
two variants), the Aggregated Variance, the Periodogram, Whittle's Local Estimator and a 
wavelet based technique. 

The R/S statistic (also known as rescaled adjusted range) is one of the oldest and best 
known techniques for estimating H. It is a time domain method which relies on considering 
the way that R/S(n) varies with n where R is the range, S is the sam ple variance and n is a 
scale (sample size) within the time series. It is discussed in detail in [sJ ] and also Q, pages 
83-87]. There are several problems with this technique which are cited in the literature. 
The estimate produced is highly sensitive to the range of scales examined. In this paper two 
versions of the estimator are used which choose the scales to investigate in different ways. 
The estimator is known to be biased and also slow to converge. It is included in this paper 
mainly for its historic importance since it has become a standard measure despite its known 
weaknesses. 

The Aggregated Variance estimator produces an estimate for the Hurst parameter by 
considering how the variance of the time series scales as the series itself is aggregated into 
blocks. Again this is a time domain technique with known weaknesses — jumps in the mean 
and slowly decaying trends in particular can be issues. A fuller description can be found in 
A page 92]. 

The periodogram is one of the oldest frequency domain based estimators and is described 
in j^jj. It involves producing an estimate for the spectral density /(A) of the time series 
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and considering the slope of this as |A| — > 0. Theoretically, for LRD, a log-log plot of the 
periodogram should have a slope of 1 — 2H close to the origin. 

Whittle's estimator j^] is a frequency domain technique which uses an approximate 
maximum likelihood estimator and an estimated functional form for the spectral density 
/(A) based upon an assumed underlying model. Here, the Local Whittle variant is used 33j] 
which is a semi-parametric version assuming a functional form for /(A) only as |A| — > 0. 

Wavelet analysis has already been mentioned as a modelling technique and has been 
used for the estimation of the Hurst parameter. In addition this has the benefit of providing 
an estimate of the multifractal spectrum of the data [3, l^]. This method is based upon 
considering the behaviour of the frequency spectrum although wavelets themselves are a 
technique to allow insight into both frequency and time-domain behaviour simultaneously. 

D. The Need for a Parsimonious and Tractable LRD Generation Method 

Given the large (and not exhaustive) list of modelling techniques already mentioned, it 
might be asked whether there is a need for another model. However, the model here is 
specifically designed to be the simplest possible computational model which produces LRD. 

Fractional Gaussian Noise and FARIMA are relatively simple to analyse from a statis- 
tical point of view (though the model described here is arguably simpler). However, these 
processes cannot easily be calculated in an ongoing manner (that is, the entire time series is 
usually generated "at once" and, having generated n points, the user must effectively start 
again to generate the n + 1th point). 

Iterated chaotic maps are computationally parsimonious but are analytically problematic 
since no closed form for the invariant density of the map described in the previous section is 
known. Therefore, it is difficult to generate traffic with a given mean using the iterated map 
method and progress theoretically is difficult. An intermittent map with a known invariant 
density is given by 34] , however, it is not known if this map would generate LRD and other 
barriers exist to computational implementation. 

The generation mechanism given here is extremely simple, theoretically sound and has 
only two parameters, the mean and the Hurst parameter. The data produced is produced 
in a stream of ones and zeros and can be simply used with simulation models of networks 
- the one representing a data packet and the zero representing an inter-packet gap. 
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II. THE MARKOV MODEL FOR LRD 



Figure El shows an infinite Markov chain which can be used to generate a time series 
exhibiting LRD. This particular chain with different transition probabilities has been studied 
by a number of authors, notably, in this context Wang ^| and Barenco and Arrowsmith 
(the latter also investigates the double sided version). The parameters fi are the transition 
probabilities for reaching a given state i from state 0. Also 7Tj is defined as the equilibrium 
probability of state i. It is clear that ^°^ fi = 1 and also that ^°^7r; = 1. More details 
and expanded versions of the proofs included here can be found in |2j, Chapter 2]. 



OFF 1 ON 




FIG. 2: An infinite Markov chain which generates a time series exhibiting LRD. 

The chain shown, given a starting state Xq G Z + , produces a Markov time series {Xi : 
i G M} where all the Xi G Z + . In turn, this chain can generate a time series {Yj : i G N} 
where Yi — if Xi — and Y t — 1 otherwise. 

It can be easily shown that the chain above is ergodic (and hence the equilibrium distri- 
bution exists) if ifi < oo and also Wi G N, 3j > i : fj > — the first condition ensures 
that the mean return time to the zero state is finite, the second ensures that any state in 
the chain can be reached from the zero state (obviously the zero state will be reached from 
any state i in exactly i steps). For the rest of this paper it will be assumed that any chain 
discussed meets these conditions for ergodicity. 

Theorem 1. The equilibrium distribution of the ith state is given by, 

oo 
j=i 

Proof. For a state i then at equilibrium, the inputs to a state will sum to 7Tj. That is, 

7Tj = T^i+l + TTofi- (2) 
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Substituting the same equation for 7r i+ i gives, 

T^i = K fi + TC f i+1 + 7T i+2 , 

and repeating this subsitution recursively gives the proof. 

Note that since Y^jLo fj = 1 t nen for 2 = this equation simply says ir 
the 7Tj must sum to one then, in addition, 

oo oo 
7To = l-^7T l = (l + ^^)- 1 , 
i=l i=l 

which, as has already been discussed, is finite. 

A. Introducing LRD into the model 

LRD with Hurst parameter H can be guaranteed if the ACF p(k) meets the condition 
given by (Q). The most obvious way to induce a correlation for a lag k into such a model is 
to choose the in such a way that unbroken sequences of k or more ones occur in the Y n 
series with the required frequency. Therefore, it would be suspected that the condition, 

P [Yi = 1, Y i+1 = 1 . . . Y l+k = 1] ~ Ck~ a , 

where a G (0,1) will produce LRD with H = 1 — a/2. To meet this requirement, the 
following strict condition is introduced for k > 0, 

oo 

Y,^ = Ck~ a , 

i=k 

where C is a constant. Note that there is no guarantee that this is a valid Markov chain - 
conditions for this will be given later. By setting k = 1 it is immediate that C = 1 — tiq. 
This gives, 

oo 

= (1 -7T )k- a k>0. 

i=k 

Subtracting the equation for k + 1 gives, 

7T fc = (l-7r )[A;- Q -(A; + l)- a ] k > 0. 

From @ for k then, 

TTo/fc = TTfc - TTfc+1 k > 0, 
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□ 

= 7r . Since all 



and therefore for k > 0, 



/* = [k- a - 2(k + I)"" + (k + 2)- a ] , (3) 

7T 



do 

—a 



E r< "- 2 E r "+E ! 

i=l i=2 i=3 



and also, since / = 1 - /i, 

/„ = l-i^ 

Most of the terms of the sum cancel leaving 

/o = 1 - — [1 " 2-1 • (4) 

7T 

The two equations, © and (jlj) form the model for LRD. The model is defined by two 
parameters 7r and a. The a parameter is related to the Hurst parameter as shown. The ir 
parameter is the equilibrium probability of state zero. Hence 1 — 7r is the sum of all other 
equilibrium probabilities and, therefore, the probability that any given Yj = 1. Therefore, 
the expectation value of Yj is given by, E [Yj] = 1 — 7r . It remains to be proved that the 
model does generate LRD with the required Hurst parameter and this is shown in the next 
section of the paper. 

It can be easily shown that this model meets the conditions for ergodicity established 
earlier. However, it should be noted that this model is not valid for every possible combina- 
tion of 7To and a. In particular, for values of tiq near zero then the term (1 — 7To) / 7To becomes 
large and values of fa from (jSJ) will be negative, a contradiction since the fi are probabilities. 
The fact that the model is invalid for some combinations of tt and a simply means that for 
practical experiments the model must be confined to the valid region. Rearranging equation 
(JH shows that for a, 7r G (0, 1) then f G (0, 1) if 

2° - 1 

*° > 2"+i_r 

and this defines a valid region for the model. 



III. THE ACF OF THE MARKOV MODEL 



It must now be shown that the model described in the previous section does produce 
traffic with a given mean and Hurst parameter. To recap, the model relies on a Markov 
chain of the form shown in Figure El and with transition probabilities given by (jSJ) and (@J). 
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The parameters of the model are ttq and a. Given some starting Xq e Z+, the Markov chain 
produces a time series {Xi : i G N} where Aj is the state of the chain at the zth iteration. 
This is used to produce another time series {Yi : i e N} where Y = if X( = and Yj = 1 
otherwise. (The time series Yi — 1 if JQ = and Yj = otherwise also produces a series 
with LRD and mean tt .) This series has LRD with mean 1 — tt and Hurst parameter 
H — 1 — a/2. That E [Yi] = 1 — ttq has already been shown. It remains to show that the 



ollows the form in ([!} and this requires a result due to Feller 



series has an ACF which 
and is based on Wang jl7[. 



A. Proof that the Chain Generates LRD 

The event e occurs whenever Xi = 0. It can easily be seen that the number of samples 
between successive occurrences of e is an independent and identically distributed variable 
and hence meets the definition in j^]. A "trial" in the terms of j^] is equivalent to one 
iteration of the Markov chain in this model. 

Definition 2. // e occurs at the zeroth trial then let the number of occurrences of e in n 
trials be N n . Let F(n) be the distribution function of the number of trials between one event 
e and the next. (Note that these definitions are those used by \s^)- 

If the event e has just occurred (at the zeroth trial) then the chain is in state 0. If the 
chain makes the transition to state i — 1 then the event e will occur in i steps. Therefore, 
the distribution function is given by 

n 

F(n) = (5) 
i=i 

The results from j^] and both assume that the distribution function F(n) obeys 

1 - F(n) ~ An 1 , (6) 

for some positive constant A and some 7. This will now be shown for the specified infinite 
chain. 

From equation (jHJ), 

n 00 00 

l-f(«) = l-E/'-i= E /< : »• 

i=l i=n+l i=n 
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Substituting /j from (jSJ): 

1-F(n)= 



\}~ a ~ 2(i + + (i + 2)-°] 

i 

^— ^ [n~ a - (n + l)""] 
1 -tt \ (n + l) Q -n a 



7T / (n + l) a n a 
1-tt \ (l + l/ n ) a -l 



7T / + l/n) a ' 

Expanding (1 + l/n) a using the binomial theorem gives 

(1 + l/n) a = l + a/n + 0(^ 2 ). 

Substituting this expression top and bottom gives 

, ,'1-tto^ 1 + a/n + 0{n~ 2 ) - 1 
1 — r (n) 



vr J n a {l + a/n + 0{n- 2 )) 
1 - tt \ n~ a {a/n + 0{n- 2 )) 



7T y (l + a/n + 0(n- 2 )) 
an 

where f(n) = 0(g(n)) for functions f(n) and ^(n) means that < Agin) for some 

positive constant A and all n > 0. This is the form required by equation (JBJ) with 7 = (1 + a) 
and A = q(l — 7r ) / 7r . 

From [35|, Theorem 10], given that the probability distribution satisifies 1—F(x) ~ Ac -7 , 
where A is a positive constant and 1 < 7 < 2 then 

2^4 

var (N n ) ~ r^^ 3 " 7 - 

1 n) (2- 7 )(3-7)/i 3 

In the case of the chain under investigation 7 = 1 + a, and A = a(l — tto)/tiq. Since the chain 

is ergodic, the mean recurrence time of state zero for the infinite chain is l/vr . Therefore, 

. 2a7Tg(l — 7T ) 2 -a /w\ 



From |12] (equations 2.26a and 2.26c) if 

var (iV n ) ~ Kn 2 - a , 
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for some positive constant K and some a G (0, 1) then the autocorrelation function is given 
by 

p{n) ~ Cn~ a , 

where C is some positive constant. This is the form required by (JTJ. 

IV. TESTS ON THE MARKOV MODEL 

In this section, two standard models for generating LRD are compared with the Markov 
model described in this paper. The computational performance of the algorithm is compared 
against other algorithms. 

A. Practical Implementation of the Model 

The only difficulty in modelling the situation on a computer comes in calculating X n+1 
when X n = 0. In this case, a random number generator and the transition probabilities 
fj must be used to find the next state. A naive approach to this would be to generate a 
random number r, uniformly distributed in (0, 1) and say that X n+ \ is the smallest i such 
that Y2]=o fi < r - This is fine for low values of i but as i increases then this procedure 
becomes inaccurate due to the finite precision arthmetic used by computers. The problem 
is that, as i increases the sum gets nearer to one but the get nearer to zero (since adding 
numbers approaching zero to numbers approaching one is likely to produce severe rounding 
error problems). Hence the errors in each stage of addition get larger. However, by the very 
nature of LRD, large values of i are very likely to come up. 

It can simply be shown that if X n = and < k < i < j, 

V[X n+1 e [i,j)\X n+1 e [k,oo)\ = 
r a - (i + i)- Q - (j + iy a + (j + 2)- Q ( 8 ) 

k~ a — (k + l)~ a 

Using this equation, Table |U shows a procedure for generating the sequence {X n : n 6 N} 
given some randomly chosen X . 
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(1) If X n > then X n+1 = X n - 1. Exit here. 

(2) Explicitly calculate P [-Xn+i > j] for values of j < N where N is some small 
integer. Use the procedure for the finite state model to find a value for X n+ \ if 
X n+1 < N. 

(3) Generate a new random number R in the range [0, 1]. 

(4) Calculate P [X n+1 e [N, 2N - l]\X n+1 € [N, oo]} from equation ©. If R is less 
than or equal to this probability then X n+ \ is in the required range. Otherwise 
go to step six. 

(5) If X n+ \ is in the required range then refine down by generating a new random 
number and seeing if X n+ \ is in the range [N, (3/2)N]. Continue refining by a 
binary search (with a new random number each time) until X n+ \ is found. Exit 
here. 

(6) Increase the value of N to 2N and go to step 3. 



TABLE I: A procedure for finding X n+ \ from X n in the infinite chain. 
B. Hurst Parameter Estimates 

Three generation mechanisms for LRD are compared, Fractional Gaussian Noise (FGN), 
iterated maps (it. map) and the Markov method developed in this paper. For each method, 
three different Hurst parameters are investigated and for each of these, three data realisations 
are created. For each realisation, one million points were generated (in the case of the iterated 
map and Markov method, each of those points was an aggregate of one hundred zeros and 
ones). The Hurst parameter was estimated using the previously discussed measurement 
techniques to check the match between theory and experiment. 

The three methods were implemented in the C programming language. On a 2GHz 
processor PC running Debian linux, to generate one million points took 55 seconds for the 
Markov method, 60 seconds for the iterated maps method and 6 seconds for the fractional 
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Gaussian noise method. However, it is debatable whether this is a fair comparison since 
the first two methods could be considered to be generating a hundred million points and 
aggregating into groups of one hundred. No C code to generate FARIMA based data was 
available and the R code available took 188 seconds to generate only a hundred thousand 
points — the run time did not seem to scale linearly and the test to generate a million points 
was stopped after several hours. 

It would naturally be expected that the FGN model is the easiest to estimate and this 
shows in the results in Table |H] All the estimators were relatively close to correct with 
the possible exception of the R/S plot on traffic with a Hurst parameter of 0.875 where the 
underestimate of H was quite severe. 

Estimates on the iterated chaotic map traffic were not so successful. The raw R/S plot 
proved inconsistent and had a hard time estimating higher hurst parameters. It should be 
noted, for example, that for H = 0.75 estimates varied from 0.678 to 0.828. The performance 
for H = 0.875 was similarly bad. The modified R/S parameter was better in that it was 
more stable across runs but tended to overestimate. Local Whittle and wavelets tended to 
overestimate the Hurst parameter. It should also be noted that the true result was regularly 
outside the 95% confidence intervals for the wavelet estimator. 

Estimates for the Markov based method were, in many ways, similar to the iterated map 
method. If anything, the results from the estimators are slightly closer to the theory and 
this is particularly notable for the wavelet and local Whittle case. The evidence provided 
by the estimators is hard to interpret. However, it can certainly be said that the results for 
the Markov method are as close as the results for the iterated map method. 

Generally, considering the estimators themselves, the R/S method seemed unreliable (and 
this agrees with theory which shows it to be a biased estimator with poor convergence). The 
local Whittle and wavelets methods which have better theoretical backing seem to have a 
better agreement with theory but it is worrying that the true Hurst parameter for the data 
lay outside 95% confidence for the wavelet estimator in many cases. 

V. CONCLUSIONS 

The method for generating LRD shown here is computationally efficient, extremely simple 
and produces a data stream with a given mean and Hurst paramter. The data stream can 
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Source H 


R/S Mod. Agg. 
R/S Var. 


Period- Local Wave- 
ogram Whit, lets 


FGN 0.625 
FGN 0.625 
FGN 0.625 


0.637 0.624 0.623 
0.632 0.624 0.622 
0.645 0.633 0.620 


0.626 0.639 0.635 
0.624 0.638 0.635 
0.622 0.638 0.635 


FGN 0.75 
FGN 0.75 
FGN 0.75 


0.728 0.738 0.741 
0.741 0.736 0.749 
0.694 0.719 0.741 


0.747 0.774 0.767 
0.755 0.776 0.769 
0.754 0.774 0.768 


FGN 0.875 
FGN 0.875 
FGN 0.875 


0.784 0.837 0.858 
0.750 0.823 0.850 
0.747 0.835 0.860 


0.877 0.908 0.897 
0.876 0.908 0.897 
0.876 0.908 0.898 


It. map 0.625 
It. map 0.625 
It. map 0.625 


0.635 0.590 0.604 
0.608 0.595 0.604 
0.637 0.594 0.610 


0.630 0.719 0.706 
0.627 0.716 0.703 
0.637 0.718 0.707 


It. map 0.75 
It. map 0.75 
It. map 0.75 


0.828 0.666 0.717 
0.725 0.650 0.712 
0.678 0.694 0.765 


0.746 0.813 0.800 
0.739 0.813 0.801 
0.768 0.814 0.803 


It. map 0.875 
It. map 0.875 
It. map 0.875 


0.703 0.779 0.851 
0.779 0.802 0.854 
0.846 0.817 0.861 


0.876 0.925 0.910 
0.877 0.924 0.910 
0.874 0.925 0.912 


Markov 0.625 
Markov 0.625 
Markov 0.625 


0.526 0.597 0.611 
0.593 0.645 0.700 
0.632 0.603 0.646 


0.621 0.703 0.691 
0.684 0.710 0.702 
0.650 0.707 0.698 


Markov 0.75 
Markov 0.75 
Markov 0.75 


0.663 0.684 0.744 
0.670 0.667 0.751 
0.671 0.671 0.724 


0.760 0.793 0.784 
0.759 0.793 0.783 
0.736 0.786 0.776 


Markov 0.875 
Markov 0.875 
Markov 0.875 


0.724 0.732 0.816 
0.757 0.754 0.830 
0.656 0.781 0.852 


0.848 0.884 0.873 
0.859 0.885 0.874 
0.866 0.885 0.875 



TABLE II: Hurst Parameter Estimates on Simulated Data. 
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be generated in an online manner (that is, the method can be started without knowing 
how many points must ultimately be generated unlike, for example, FGN). The method has 
been proved theoretically to generate LRD with the required parameters and this has been 
tested against a variety of known estimators for the Hurst parameter. It is interesting to 
see quite how badly certain estimators perform even against very standard LRD generation 
mechanisms. 

Compared with existing methods of generating LRD this procedure has a number of 
extremely attractive properties. It is computationally and mathematically extremely simple. 
While other models may have more flexibility for precisely representing the nature of the 
time series being simulated, it is hard to imagine a simpler model for generating LRD. It is 
hoped, therefore, that this model will be tractable analytically for further developments, for 
example, analysis of queuing performance of traffic generated by such a model. 

Acknowledgments 

The authors would like to thank David Arrowsmith and Martino Barenco for their con- 
tributions to this research. 



[1] J. Beran, Statistics For Long-Memory Processes (Chapman and Hall, 1994). 

[2] R. G. Clegg, Ph.D. thesis, Dept. of Math., Uni. of York., York. (2004), available online at: 

www . richardclegg . org/pubs/thesis . pdf . 
[3] W. E. Leland, M. S. Taqqu, W. Willinger, and D. V. Wilson, in Proc. ACM SIGCOMM, 

edited by D. P. Sidhu (San Francisco, California, 1993), pp. 183-193. 
[4] W. Willinger, M. S. Taqqu, R. Sherman, and D. V. Wilson, IEEE/ACM Trans, on Networking 

5, 71 (1997). 

[5] R. Morris and D. Lin, in Proc. IEEE INFOCOM (2000), pp. 360-366. 
[6] T. Oiura and R. Sawatari, Phys. Rev. E 58, 193 (1998). 
[7] S. Valverde and R. V. Sole, Physica A 289, 595 (2001). 

[8] M. Woolf, D. K. Arrowsmith, R. J. Mongragon, and J. M. Pitts, Phys. Rev. E 66, 046106 
(2002). 



17 



[9] A. Erramilli, O. Narayan, and W. Willinger, IEEE/ACM Trans, on Networking 4, 209 (1996). 
[10] I. Norros, Queueing Systems 16, 387 (1994). 

[11] Z. Sahinoglu and S. Tekinay, IEEE Communications Magazine January, 48 (1999). 
[12] A. L. Neidhardt and J. L. Wang, in Proceedings of the 1998 ACM SIGMETRICS joint inter- 
national conference on Measurement and Modeling of Computer Systems (1998), pp. 222-232. 
[13] B. B. Mandelbrot, Water Resources Research 7, 543 (1971). 
[14] R. B. Davies and D. S. Harte, Biometrika 74, 95 (1987). 
[15] V. Paxson, Computer Comm. Rev. 27, 5 (1997). 
[16] C. W. J. Granger and R. Joyeux, J. Time Ser. Anal. 1, 15 (1980). 
[17] X. J. Wang, Phys. Rev. A 40, 6647 (1989). 

[18] A. Erramilli, R. P. Singh, and P. Pruthi, Queueing Systems 20, 171 (1995). 

[19] H. Suetani and T. Horita, Phys. Rev. E 60, 422 (1999). 

[20] T. Horita and H. Suetani, Phys. Rev. E 65, 056217 (2002). 

[21] P. Ashwin, A. M. Rucklidge, and R. Sturman, Physica D 194, 30 (2004). 

[22] M. Giampieri and S. Isola, Disc, and Cont. Dyn. Sys. 12, 115 (2005). 

[23] M. Barenco and D. Arrowsmith, Dynamical Systems 19, 61 (2004). 

[24] R. H. Riedi, in Theory And Applications Of Long-Range Dependence, edited by P. Doukhan, 
G. Oppenheim, and M. S. Taqqu (Birkhauser, 2003), pp. 625-716. 

[25] R. H. Riedi, M. S. Crouse, V. J. Ribeiro, and R. G. Baraniuk, IEEE Special Issue On Infor- 
mation Theory 45 (April), 992 (1999). 

[26] M. S. Taqqu, Murad S. Taqqu's Homepage: 
math . bu . edu/ individual/mur ad/home . html. 

[27] M. Taqqu, V. Teverovsky, and W. Willinger, Fractals 3, 785 (1995). 

[28] M. S. Taqqu and V. Teverovsky, Stochastic Models 13, 723 (1997). 

[29] J.-M. Bardet, G. Lang, G. Oppenheim, A. Phillipe, S. Stoev, and M. S. Taqqu, in Theory and 
Applications of Long-Range Dependence, edited by P. Doukhan, G. Oppenheim, and M. S. 
Taqqu (Birkhauser, 2003), pp. 557-577. 

[30] B. B. Mandelbrot and J. R. Wallis, Water Resources Research 5, 228 (1969). 

[31] J. Geweke and S. Porter-Hudak, J. Time Ser. Anal. 4, 221 (1983). 

[32] R. Fox and M. S. Taqqu, The Annals of Statistics 14, 517 (1986). 

[33] P. M. Robinson, The Annals of Statistics 23, 1630 (1995). 

18 



[34] R. Artuso and G. Cristadoro, Physical Review Letters 90, 244101 (2003). 
[35] W. Feller, Trans, of the Amer. Math. Soc. 67, 94 (1949). 



19 



